'twazn me!!! ;(' Automatic Authorship Analysis of Micro-Blogging Messages
نویسندگان
چکیده
In this paper we propose a set of stylistic markers for automatically attributing authorship to micro-blogging messages. The proposed markers include highly personal and idiosyncratic editing options, such as ‘emoticons’, interjections, punctuation, abbreviations and other low-level features. We evaluate the ability of these features to help discriminate the authorship of Twitter messages among three authors. For that purpose, we train SVM classifiers to learn stylometric models for each author based on different combinations of the groups of stylistic features that we propose. Results show a relatively good-performance in attributing authorship of micro-blogging messages (F = 0.63) using this set of features, even when training the classifiers with as few as 60 examples from each author (F = 0.54). Additionally, we conclude that emoticons are the most discriminating features in these groups.
منابع مشابه
Sentiment Analysis of Microblogs
In this project we attempt to perform sentiment based classification of Micro-blogs using Machine Learning techniques. Sentiment Analysis of short messages posted on Micro-blogging tools can be helpful in determining the current usability and acceptance of any target product or service. It can help in raising alarms in the wake of sudden shifts in user sentiments or attitude towards the service...
متن کاملLiao, Yang, Masud Moshtaghi, Bo Han, Shanika Karunasekera, Ramamohanarao Kotagiri, Timothy Baldwin, Aaron Harwood and Philippa Pattison (to appear) Mining Micro-Blogs: Opportunities and Challenges, In Ajith Abraham and Aboul Ella Hassanien (eds.) Social Networks: Computational Aspects and Mining, Springer
This chapter investigates whether and how micro-messaging technologies such as Twitter messages can be harnessed to obtain valuable information. The interesting characteristics of micro-blogging services, such as being user oriented, provide opportunities for different applications to use the content of these sites to their advantage. However, the same characteristics become the weakness of the...
متن کاملPosting with credibility in Micro-blogging systems using Digital Signatures and Watermarks: A case study on Twitter
Micro-blogs are contemporary broadcasting services, for exchanging small elements of content, including video and images. Despite its popularity, micro-blogging is not without issues. So far, various security concerns, such as: privacy and confidentiality of micro-blogging systems have attracted the interest of the scientific community. Nevertheless, in this document we refer to a security issu...
متن کاملIdentifying Automatic Posting Systems in Microblogs
In this paper we study the problem of identifying systems that automatically inject non-personal messages in micro-blogging message streams, thus potentially biasing results of certain information extraction procedures, such as opinion-mining and trend analysis. We also study several classes of features, namely features based on the time of posting, the client used to post, the presence of link...
متن کاملAnalyzing user behavior of the micro-blogging website Sinaweibo during hot social events
The spread and resonance of users’ opinions on SinaWeibo, the most popular micro-blogging website in China, are tremendously influential, having signif icantly affected the processes of many real-world hot social events. We select 21 hot events that were widely discussed on SinaWeibo in 2011, and do some statistical analyses. Our main findings are that (i) male users are more likely to be invol...
متن کامل